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1 Introduction 

A fundamental topic in Combinatorics on Words is the study of repetitions. A 
repetition in a word is a factor that is formed by the concatenation of two or 
more identical blocks. The simplest kind of repetition is a square, that is the 
concatenation of two copies of the same block, like sciascia. A famous conjecture 
of Fraenkel and Simpson m states that a word of length n contains less than 
n distinct square factors. Experiments strongly suggest that the conjecture is 
true, but a theoretical proof of the conjecture seems difficult. In [10], the authors 
proved a bound of 2 n. In [T2], Hie improved this bound to 2 n — <9(logn), but the 
conjectured bound is still far away. 

Among the different generalizations of the notion of repetition, a prominent 
one is that of an abelian repetition. An abelian repetition in a word is a factor 
that is formed by the concatenation of two or more blocks that have the same 
number of occurrences of each letter in the alphabet. Of course, the simplest kind 
of abelian repetition is an abelian square, that is therefore the concatenation of 
a word with an anagram of itself, like viavai. Abelian squares were considered in 
1961 by Erdos j8], who conjectured that there exist infinite words avoiding abelian 
squares (this conjecture has later been proved to be true, and the smallest possible 
size of an alphabet for which it holds has been proved to be 4 [T5] ). 

We focus on the maximum number of abelian squares that a word can con¬ 
tain. Opposite to case of ordinary squares, a word of length n can contain (9(n 2 ) 
distinct abelian square factors (see [14] )• Since the total number of factors in a 
word of length n is quadratic in n, this means that there exist words in which 
a fixed proportion of all factors are abelian squares. So we turn our attention to 
infinite words, and we wonder whether there exist infinite words such that for 
every n any factor of length n contains, on average, a number of abelian squares 
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that is quadratic in n. We call such an infinite word abelian-square rich. Since a 
random binary word of length n contains 0{n^/n) distinct abelian square factors 
23, the existence of abelian-square rich words is not immediate. We also introduce 
uniformly abelian-square rich words, that are infinite words such that for every 
n, every factor of length n contains a quadratic number of abelian squares. 

As a first result, we prove that the famous Thue-Morse word is uniformly 
abelian-square rich. Then we look at the class of Sturmian words, that are aperi¬ 
odic infinite words with the lowest factor complexity. In this case, we prove that 
if a Sturmian word is /3-power free for some /3 > 2 (that is, does not contain 
repetitions of order /3 or higher), then it is uniformly abelian-square rich. 


2 Notation and Background 

Let E = {«i, 02 ,.. -, a a } be an ordered rr-letter alphabet. Let E* stand for the 
free monoid generated by A, whose elements are called words over E. The length 
of a word w is denoted by |io|. The empty word , denoted by e, is the unique word 
of length zero and is the neutral element of E*. We also define E + = E* \ {e}. 

A prefix (resp. a suffix) of a word w is any word u such that w = uz (resp. w = 
zu ) for some word z. A factor of w is a prefix of a suffix (or, equivalently, a suffix 
of a prefix) of w. . The set of prefixes, suffixes and factors of the word w are 
denoted by Prefiw), Suff(w) and Fact{w ), respectively. From the definitions, we 
have that £ is a prefix, a suffix and a factor of any word. 

For a word w and a letter cq £ E , we let |u>| ai denote the number of occurrences 
of ai in w. The Parikh vector (sometimes called composition vector) of a word 
w over E = {or, o 2 ,..., a a } is the vector P(w) = (|w| ai , \w\ a2 , ..., |tu| a< ,). An 
abelian k-power is a word of the form V\V 2 ■ ■ • Vk where all the vfis have the same 
Parikh vector. An abelian 2-power is called an abelian square. 

An infinite word w over E is an infinite sequence of letters from E , that is, 
a function w : N K > E. Given an infinite word w, the recurrence index R w (n) 
of w is the least integer to (if any exists) such that every factor of w of length 
to contains all factors of w of length n. If the recurrence index is defined for 
every n, the infinite word w is called uniformly recurrent and the function R w (n) 
the recurrence function of w. A uniformly recurrent word w is called linearly 
recurrent if the ratio R w (n)/n is bounded. Given a linearly recurrent word w, the 
real number r w = lim sup^^^ R w {n)/n is called the recurrence quotient of w. 

The factor complexity function of an infinite word w is the integer function 
p w (n) defined by p w {n) = \Fact(w)C\E n \. An infinite word w has linear complexity 
if p w (n) = 0(n). 

A substitution over the alphabet E is a map t : E i—> E + . Using the extension 
to words by concatenation, a substitution can be iterated. Note that for every 
substitution r and every n > 0, r" is again a substitution. Moreover, a substitution 
r over E can be naturally extended to a morphism from E* to E *, since for every 
u,v € E*, one has t(uv) = t(u)t(v), provided that one defines r(e) = e. A 
substitution r is k-uniform if there exists an integer k > 1 such that for all 
a £ E, |r(a)| = k. We say that a substitution is uniform if it is fc-uniform for 
some k > 1. A substitution r is primitive if there exists an integer n > 1 such 
that for every a € E, T n (a) contains every letter of E at least once. In this paper, 
we will only consider primitive substitutions such that r(ai) = a±v for some non¬ 
empty word v. These substitutions always have a fixed point, which is the infinite 
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word w = linin-^oo t"(cii). Moreover, this fixed point is linearly recurrent (see for 
example m and therefore has linear complexity. 

3 Abelian-square Rich Words 

Kociumaka et al. [H] showed that a word of length n can contain a number of 
distinct abelian square factors that is quadratic in n. We give here a proof of this 
fact for the sake of completeness. 

Proposition 1 . A word of length n can contain 0(n 2 ) distinct abelian square 
factors. 

Proof. Consider the word w n = a n ba n ba n , of length 3n + 2. For every 0 < i, j < n 
such that i + j + n is even, the factor a l ba n ba :l of w is an abelian square. Since the 
number of possible choices for the pair ( i,j ) is quadratic in n, we are done. □ 

Motivated by the previous result, we wonder whether there exist infinite words 
such that all their factors contain a number of abelian squares that is quadratic 
in their length. But first, we relax this condition and consider words in which, for 
every sufficiently large n, a factor of length n contains, on average, a number of 
distinct abelian square factors that is quadratic in n. 

Definition 1 . An infinite word w is abelian-square rich if and only if there exists 
a positive constant C such that for every n sufficiently large one has 

— 7 — V {# abelian square factors of v} > Cn 2 . 

p v ,(n) 

v E Fact(w) fl S n 

Notice that Christodoulakis et al. [J proved that a binary word of length 
n contains 0{n^/n) distinct abelian square factors on average, hence an infinite 
binary random word is almost surely not abelian-square rich. 

Given a finite or infinite word w, we let ASF w (n) denote the number of abelian 
square factors of w of length n. Of course, ASF w {n) = 0 if n is odd, so this quantity 
is significant only for even values of n. 

The following lemma is a consequence of the definition of linearly recurrent 
word. 

Lemma 1 . Let w be a linearly recurrent word. If there exists a constant C such 
that for every n sufficiently large one has '^2 m < n ASF w (m) > Cn 2 , then w is 
abelian-square rich. 

In an abelian-square rich word the average number of abelian squares in a 
factor is quadratic in the length of the factor. A stronger condition is that ev¬ 
ery factor contains a quadratic number of abelian squares. We thus introduce 
uniformly abelian-square rich words. 

Definition 2. A n infinite word w is uniformly abelian-square rich if and only if 
there exists a positive constant C such that for every n sufficiently large one has 

inf {# abelian square factors of v} > Cn 2 . 

veFact(w)nz 
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Clearly, if a word is uniformly abelian-square rich, then it is also abelian-square 
rich, but the converse is not always true. However, in the case of linearly recurrent 
words, the two definitions are equivalent, as shown in the next lemma. 

Lemma 2. Ifw is abelian-square rich and linearly recurrent, then it is uniformly 
abelian-square rich. 

Proof. Since w is linearly recurrent, there exists a positive integer K such that 
every factor of w of length Kn contains all the factors of w of length n. Let v be 
a factor of w of length n containing the largest number of abelian squares among 
the factors of w of length n. Hence the number of abelian squares in v is at least 
the average number of abelian squares in a factor of w of length n. Since w is 
abelian square rich, the number of abelian squares in v is greater than or equal 
to Cn 2 , for a positive constant C and n sufficiently large. Since v is contained in 
any factor of w of length Kn , the number of abelian squares in any factor of w of 
length Kn is greater than or equal to Cn 2 , whence the statement follows. □ 

The rest of this section is devoted to prove that the Thue-Morse word and the 
Sturmian words that do not contain arbitrarily large repetitions are uniformly 
abelian-square rich. 

3.1 The Thue-Morse Word 

Let 

t = 011010011001011010010110 • •• 

be the Thue-Morse word, i.e., the fixed point of the uniform substitution // : 0 i—>• 
01,1 i—>■ 10. For every n > 4, the factors of length n of t belong to two disjoint 
sets: those that start only at even positions in t, and those that start only at odd 
positions in t. This is a consequence of the fact that t is overlap-free, hence 0101 
cannot be preceded by 1 nor followed by 0, and that 00 and 11 are not images of 
letters, so they cannot appear at even positions. 

Let p(n) be the factor complexity function of t. It is known [TJ Proposition 4.3], 
that for every n > 1 one has p[2n) = p{n) + p{n + 1) and p(2n + 1) = 2 p(n + 1). 

The next lemma (proved in [3]) shows that the Thue-Morse word has the 
property that for every length there are at least one third of the factors that 
begin and end with the same letter, and at least one third of the factors that 
begin and end with different letters. We define / aa (n ) (resp. f a b(n )) as the number 
of factors of t of length n that begin and end with the same letter (resp. with 
different letters). 

Lemma 3 ([3]). For every n >2, one has f aa {n ) > p(n)/3 and f a b{n) > pin)/ 3. 

Since p(n) > 3(n — 1) for every n [61 Corollary 4.5], we get the following result. 

Corollary 1. For every n > 2, one has f a a{n) > n — 1 and fabin) > n — 1. 

Proposition 2. The Thue-Morse word t is uniformly abelian-square rich. 

Proof. Let u be a factor of length n > 1 of t that begins and ends with the same 
letter. Since the image of any even-length word under p is an abelian square, we 
have that p 2 ( u ) is an abelian square factor of t of length 4n that begins and ends 
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with the same letter. Moreover, the word obtained from p 2 (u) by removing the 
first and the last letter is an abelian square factor of t of length 4 n — 2. So, by 
Corollary [l] t contains at least n — 1 abelian square factors of length 4 n and at 
least n — 1 abelian square factors of length An — 2. This implies that for every even 
n the number of abelian square factors of t of length n is linear in n. Hence, for 
every n the number of abelian square factors of t of length at most n is quadratic 
in n. The statement then follows from Lemmas □ and [2] □ 


3.2 Sturmian Words 

In this section we fix the alphabet £ = {a,b}. 

Recall that a (finite or infinite) word w over £ is balanced if and only if for 
any u,v factors of w of the same length, one has ||u| a — |u| a | < 1. 

We start with a simple lemma. 

Lemma 4. Let w be a finite balanced word over £. Then for any k > 0, P(w) = 
(0,0) mod k if and only if w is an abelian k-power. 

Proof. Let w be balanced and P(w ) = (fcs, fcf), for a positive integer k and some 
s, t > 0. Then we can write w = V\V 2 ■ ■ • Vk where each t>,; has length s + t. Now, 
each Vi must have Parikh vector equal to (s, t) otherwise w would not be balanced, 
whence the only if part of the statement follows. The if part is straightforward. 

□ 

A binary infinite word is Sturmian if and only if it is balanced and aperiodic. 
Sturmian words are precisely the infinite words having n + 1 distinct factors of 
length n for every n > 0. There is a lot of other equivalent definitions of Sturmian 
words. A classical reference on Sturmian words is Ha Chapter 2]. Let us recall 
here the definition of Sturmian words as codings of a rotation. 

We fix the torus I = R/Z = [0,1). Given a, ft in /, if a > fi, we use the 
notation [a, (3) for the interval [a, 1) U [0,/J). Recall that given a real number a, 
|_aj is the greatest integer smaller than or equal to a, [a] is the smallest integer 
greater than or equal to a, and {a} = a — |_aj is the fractional part of a. Notice 
that {—a} = 1 — {a}. 

Let a £ I be irrational, and p £ I. The Sturmian word s ajP (resp. s' a ) of 
angle a and initial point p is the infinite word s ajP = aoaia 2 • • • defined by 

_ j b if {p + na} € /b, 

\ a if {p + na} £ / a , 

where I b = [0,1 — a) and / a = [1 — a, 1) (resp. I b = (0,1 — a] and / a = (1 — a, 1]). 

In other words, take the unitary circle and consider a point initially in position 
p. Then start rotating this point on the circle (clockwise) of an angle a, 2a, 3a, 
etc. For each rotation, take the letter a or b associated with the interval within 
which the point falls. The infinite sequence obtained in this way is the Sturmian 
word s a , P (or s' a p , depending on the choice of the two intervals). See Fig. [I] for 
an illustration. 

For example, if <p = (l + -\/5)/2 ~ 1.618 is the golden ratio, the Sturmian word 

F = s v - 1,^-1 = abaababaabaababaababaabaababaabaab 
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Fig. 1. The rotation of angle a = tp — 1 ss 0.618 and initial point p = a generating the 
Fibonacci word F = s v - i, v -i = abaababaabaabab 


is called the Fibonacci word: 

A Sturmian word for which p = a, like the Fibonacci word, is called charac¬ 
teristic. Note that for every a one has s aj o = bs a)Q and s' a0 = a s a>a . 

An equivalent way to see the coding of a rotation consists in fixing the point 
and rotating the intervals. In this representation, the interval It, = I£ is rotated 
at each step, so that after i rotations it is transformed into the interval Iff 1 = 
[{-*«}, {-(« + l)a}), while /“* = I\ I~\ 

This representation is convenient since one can read within it not only a Stur¬ 
mian word but also any of its factors. More precisely, for every positive integer n, 
the factor of length n of Sq, iP starting at position j > 0 is determined by the value 
of {p + ja} only. Indeed, for every j and i, we have: 

fbif {p + ja}& V; 

\aif {p + ja}el~\ 

As a consequence, we have that given a Sturmian word s QjP and a positive integer 
n, the n + 1 different factors of s ajP of length n are completely determined by 
the intervals 1 ,... ,/ b ^" _1 \ that is, only by the points {— ia}, 0 < i < n. 

In particular, they do not depend on p , so that the set of factors of s aiP is the 
same as the set of factors of s ajP ' for any p and p'. Hence, from now on, we let s a 
denote any Sturmian word of angle a. 

If we arrange the n+2 points 0,1, {—a}, {—2a},..., {—na} in increasing order, 
we determine a partition of I in n+ 1 subintervals, Lq(ji), Li(n),..., L n (n). Each 
of these subintervals is in bijection with a different factor of length n of any 
Sturmian word of angle a (see Fig. [2]). 

Recall that a factor of length n of a Sturmian word s a has a Parikh vector equal 
either to (\na\,n— [na\) (in which case it is called light ) or to ([ ~na\,n— [na]) 
(in which case it is called heavy). The following proposition relates the intervals 
Li(n ) to the Parikh vectors of the factors of length n (see [9] and m)- 

Proposition 3. Let s a be a Sturmian word of angle a, and n a positive integer. 
Let ti be the factor of length n associated with the interval Li ( n ). Then L is heavy 
if Li(n) C [{—na},l), while it is light if Li(n) C [0, {—na}). 
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Fig. 2. The points 0, 1 and {—a}, {—2a}, {—3a}, {—4a}, {—5a}, {—6a}, arranged in 
increasing order, define the intervals Lo(6) « [0,0.146), Li(6) « [0.146,0.292), 1/2(6) ~ 
[0.292,0.382), L 3 (6) « [0.382,0.528), L 4 (6) « [0.528,0.764), L s (6) « [0.764,0.910), 
Lq{ 6) ss [0.910,1). Each interval is associated with one of the factors of length 6 of the 
Fibonacci word, respectively babaab, baabab, baabaa, ababaa, abaaba, aababa, aabaab. 


Example 1. Let a = ip — 1 « 0.618 and n = 6. We have 6a ss 3.708, so that 
{—6a} ss 0.292. The reader can see in Fig. [2] that the factors of length 6 corre¬ 
sponding to intervals above (resp. below) {—6a} ss 0.292 all have Parikh vector 
(4,2) (resp. (3,3)). That is, the intervals L 0 and L\ are associated with light 
factors (babaab, baabab), while the intervals L 2 to L§ are associated with heavy 
factors (baabaa, ababaa, abaaba, aababa, aabaab). 

Observe that, by Lemma[4j every factor of a Sturmian word having even length 
and containing an even number of a’s (or, equivalently, of b’s) is an abelian square. 
The following proposition relates the abelian square factors of a Sturmian word 
of angle a with the arithmetic properties of a. 

Proposition 4. Let s a be a Sturmian word of angle a, and n a positive even 
integer. Let U be the factor of length n associated with the interval Li(n). Then 
L is an abelian square if and only if Li(n) C [{— na}, 1) if [na J is even, or 
Li{n ) C [0, {— na}) if [naj is odd. 

Proof. By Proposition [3j U is heavy if and only if Li(n) C [{—na}, 1), while it is 
light if and only if Li(n) C [0, {—na}). If [naj is even, then every light factor of 
length n contains an even number of a’s and hence is an abelian square, while if 
{na\ is odd, then every heavy factor of length n contains an even number of a’s 
and hence is an abelian square, whence the statement follows. □ 

Recall that given a finite or infinite word w, ASF w (n) denotes the number of 
abelian square factors of w of length n. 
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n 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 


ASF F (n) 0 1 3 5 1 9 5 5 15 3 13 13 5 25 9 15 25 21 27 


Table 1 . The first values of the sequence ASFp(n) of the number of abelian square 
factors of length n in the Fibonacci word F = s v - 


Corollary 2 . Let s a be a Sturmian word of angle a. For every positive even n, 
let I n = {{— ia} | 1 < i <n}. Then 

ASF (n) - /^ X £ln \ x - */ V na \ is even i 

1 #{x £ I n \ x > {—na}} if [na J is odd. 

Example 2. The factors of length 6 of the Fibonacci word F are, lexico¬ 
graphically ordered: aabaab, aababa, abaaba, ababaa, baabaa (heavy factors), 
baabab, babaab (light factors). The light factors, whose number of a’s is [6aJ = 3, 
are not abelian squares; the heavy factors, whose number of a’s is [6a] = 4, are 
all abelian squares. 

We have I e = {0.382,0.764,0.146,0.528,0.910,0.292} (values are approxi¬ 
mated) and 6a ~ 3.708, so [6aJ is odd. Thus, there are 5 elements in J 6 that 
are > {—6a}, so by Corollary [2] there are 5 abelian square factors of length 6. 

The factors of length 8 of the Fibonacci word are, lexicographically 
ordered: aabaabab, aababaab, abaabaab, abaababa, ababaaba, baabaaba. 
baababaa. babaabaa (heavy factors), babaabab (light factor). The light fac¬ 
tor, whose number of a’s is [8aJ = 4, is an abelian square; the heavy fac¬ 
tors, whose number of a’s is [8a] = 5, are not abelian squares. We have 
Is = {0.382,0.764,0.146,0.528,0.910,0.292,0.674,0.056} (values are approxi¬ 
mated) and 8a ~ 4.944, so [8a] is even. Thus, there is only one element in 
I s that is < {8a}, so by Corollary [2] there is only one abelian square factor of 
length 8. 

In Table[l]we report the first values of the sequence ASFp{n ) for the Fibonacci 
word F. 


Recall that every irrational number a can be uniquely written as a (simple) 
continued fraction as follows: 


1 

a — cio 4-i— 

CL I + -j- 

1 a2 + ... 


(i) 


where ao — [a], and the infinite sequence (uj)i>o is called the sequence of partial 
quotients of a. The continued fraction expansion of a is usually denoted by its 
sequence of partial quotients as follows: a = [do; di, d 2 ,...], and each its finite 
truncation [do; di, d 2 ,..., a*,] is a rational number nk/mk called the Ath conver¬ 
gent to a. We say that an irrational a = [do; di, U 2 ,...] has bounded partial 
quotients if and only if the sequence (dj)j >0 is bounded. 

The development in continued fraction of a is deeply related to the exponent 
of the factors of the Sturmian word s a . Recall that an infinite word w is said to 
be /3-power free, for some /3 > 2, if for every factor v of w, the ratio between 
the length of v and its minimal period is smaller than /3. The second author m 
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proved that a Sturmian word of angle a is /3-power free for some /3 > 2 if and 
only if a has bounded partial quotients. 

Since the golden ratio ip is defined by the equation ip = 1 + l/<p, we have from 
Equation [I] that ip = [1; 1,1,1,1,...] and therefore tp — 1 = [0; 1,1,1,1,...], so 
the Fibonacci word is an example of /3-power free Sturmian word (actually, it is 
(2 + v?)-power free [T5]V 

We are now proving that if a has bounded partial quotients, then the Sturmian 
word s a is abelian-square rich. For this, we will use a result on the discrepancy of 
uniformly distributed modulo 1 sequences from m- To the best of our knowledge, 
this is the first application of this result to the theory of Sturmian words, and we 
think that the correspondence we are now showing might be useful for deriving 
other results on Sturmian words. 

Let u> = (x n ) n >o be a given sequence of real numbers. For a positive integer 
N and a subset E of the torus /, we define A(E; N;ui) as the number of terms 
x n , 0 < n < N, for which {x n } £ E. If there is no risk of confusion, we will write 
A(E;N) instead of A(E; N;lo). 


Definition 3. The sequence uj = ( x n ) n >o of real numbers is said to be uniformly 
distributed modulo 1 if and only if for every pair a , b of real numbers with 0 < a < 
b < 1 we have 


lim 

N—too 


A{[a, b); N ; u>) 
N 


= b — a. 


Definition 4. Letx 0 , 2 : 1 , ...,xn be a finite sequence of real numbers. The number 


D n = D n (x o, x \,..., xn) 


sup 

0<7<«5<! 


N 


( 5 - 7 ) 


is called the discrepancy of the given sequence. For an infinite sequence oj of real 
numbers the discrepancy Dn(uj) is the discrepancy of the initial segment formed 
by the first N + 1 terms of to. 


The two previous definitions are related by the following result. 

Theorem 1 ( |l6j ). The sequence to is uniformly distributed modulo 1 if and only 
if linqv-^oo D n (uj) = 0. 

An important class of uniformly distributed modulo I sequences is given by the 
sequence (na)„>o with a a given irrational number and n £ N. The discrepancy 
of the sequence ( na ) will depend on the finer arithmetical properties of a. In 
particular, we have the following theorem, stating that if a has bounded partial 
quotients, then its discrepancy has the least order of magnitude possible. 

Theorem 2 ( |l6j f. Suppose the irrational a = [ao;ai,...] has partial quotients 
bounded by K. Then the discrepancy D^{uj) of u> = (na) satisfies NDn(ui) = 
O(logiV). More exactly, we have 

ND ^^ 3+ {^M^v)' oeN - (2) 

We are now using previous definitions and results to prove that /3-power free 
Sturmian words are abelian-square rich. 
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Theorem 3. Let s a be a Sturmian word of angle a such that a has bounded 
partial quotients. Then there exists a positive constant C such that for every n 
sufficiently large one has Jf m<n ASF Sa (m) > Cn 2 . 

Proof. For every even n, let I' n = {{za} \ 1 < i < n}. By Corollary [2] and basic 
arithmetical properties of the fractional part, we have: 


ASF Sa (n) = 


ff{x £ I' n | x > {na}} if \na J is even; 
#{x £ I' n | x < {na}} if [naj is odd. 


So: 


X] ASF Sa (m) (3) 

m<n 

> ^ #{{za} | {za} < 1/2, i < m, and {ma} < 1/2, |_maj even } (4) 

m<n 

> ^ #{{za/2} | {za/2} £ [1/4,1/2), i < m, and {ma/2} < 1/4} (5) 

m<n 

> ^ #{{za/2} | {za/2} £ [1/4,1/2 ),i < n/2, and {ma/2} < 1/4} ( 6 ) 

n/2<m<n 

= #{{za/2} | {za/2} £ [1/4,1/2), * < n/2} x ^ {m\ {ma/2} < 1/4} (7) 

n/2<m<n 

where: (4) follows from (3) by Corollary[2j (5) follows from (4) because {ma/2} < 
1/4 implies {ma} < 1/2 and [maj is even if and only if {?na/2} < 1/2; ( 6 ) follows 
from (5) is obvious; finally (7) follows from ( 6 ) because the cardinality of the first 
set is independent from the sum. 

Now, a/2 has bounded partial quotients (since a has) and we can apply The¬ 
orem [2] to evaluate the two factors of (7). So we have: 

#{{za/2} | {za/2} £ [1/4,1/2 ),i < n/2} 

= A([l/4,l/2);n/2;(na/2)) 

> (1/2 — l/4)n/2 — Ci logn 
= n /8 — Ci logn, 

for n sufficiently large and a positive constant C\. We also have: 
y {m | {ma/2} < 1/4} 

n/2<m<n 

= x4([0,1/4); n; (na/2)) - A([0,1/4); n/2; (na/2)) 

> n/4 — C 2 log n — n /8 — C 3 logzz 
= n /8 — C 4 log n, 

for n sufficiently large and positive constants C 2 , C 3 , C 4 . The product of the two 
factors of ( 6 ) is therefore greater than a constant times n 2 , as required. □ 

The recurrence quotient r a of a Sturmian word of angle a = [0; aq, 02 ,...] 
such that a has bounded partial quotients verifies 2 + r a < lim sup at <3 + r a 
[2] Prop. 5]. Moreover, Durand [7( proved that a Sturmian word of angle a is 
linearly recurrent if and only if a has bounded partial quotients. Thus, we have 
the following: 


Words with the Maximum Number of Abelian Squares 
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Corollary 3. Let s a be a Sturmian word of angle a. If s a is f3-power free, then 
s a is uniformly abelian-square rich. 

Proof. We known that s a is /3-power free for some /3 > 2 if and only if a has 
bounded partial quotients if and only if s a is linearly recurrent. The statement 
then follows from Theorem [3] and Lemmas Q] and [2j □ 


4 Conclusions and future work 

We proved that the Thue-Morse is uniformly abelian-square rich. We think that 
the technique we used for the proof can be generalized to some extent, and could 
be used, for example, to prove that a class of fixed points of uniform substitutions 
are uniformly abelian-square rich. 

We also proved that Sturmian words that are /3-power free for some /3 > 2 are 
uniformly abelian-square rich. The proof we gave is based on a classical result on 
the discrepancy of the uniformly distributed modulo 1 sequence (na) n > q, where 
a is the slope of the Sturmian word. To the best of our knowledge, this is the 
first application of this result to the theory of Sturmian words, and we think that 
the correspondence we have shown might be useful for deriving other results on 
Sturmian words. 

The natural question that then arises is whether the hypothesis of power free¬ 
ness is necessary for a Sturmian word being (uniformly) abelian-square rich. We 
leave open the question to determine whether s a is not uniformly abelian-square 
rich nor abelian-square rich in the case when a has unbounded partial quotients. 

We mostly investigated binary words in this paper. We conjecture that bi¬ 
nary words have the largest number of abelian square factors. More precisely, we 
propose the following conjecture. 

Conjecture 1. If a word of length n contains k many distinct abelian square fac¬ 
tors, then there exists a binary word of length n containing at least k many distinct 
abelian square factors. 

A slightly different point of view from the one we considered in this paper 
consists in identifying two abelian squares if they have the same Parikh vector. 
Two abelian squares are therefore called inequivalent if they have different Parikh 
vectors m Sturmian words only have a linear number of inequivalent abelian 
squares. Nevertheless, a word of length n can contain 0(n^/n) inequivalent abelian 
squares m- Computations support the following conjecture: 

Conjecture 2 (see M)- A word of length n contains 0{n^/n) inequivalent abelian 
squares. 
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